Picture for Fei Huang

Fei Huang

additional authors not shown

SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training

Add code
Jun 01, 2026
Viaarxiv icon

XDomainBench: Diagnosing Reasoning Collapse in High-Dimensional Scientific Knowledge Composition

Add code
May 14, 2026
Viaarxiv icon

Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control

Add code
May 14, 2026
Viaarxiv icon

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Add code
May 12, 2026
Viaarxiv icon

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Add code
Apr 13, 2026
Viaarxiv icon

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

Add code
Feb 27, 2026
Viaarxiv icon

WebWorld: A Large-Scale World Model for Web Agent Training

Add code
Feb 16, 2026
Viaarxiv icon

P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling

Add code
Feb 12, 2026
Viaarxiv icon

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Add code
Feb 04, 2026
Viaarxiv icon

CorpusQA: A 10 Million Token Benchmark for Corpus-Level Analysis and Reasoning

Add code
Jan 21, 2026
Viaarxiv icon